Corpus-based metrics for assessing communal common ground
نویسندگان
چکیده
This article presents the first attempt to construct a computational model of common ground. Four corpus-based metrics are presented that estimate what facts are likely to be in common ground. The proposed metrics were evaluated in an experiment with human participants, focussing on a domain of famous people. The results are encouraging: two of the proposed metrics achieved a large positive correlation between the estimates of how widely known a property of a famous person is and the percentage of participants who knew the corresponding property.
منابع مشابه
Content Selection Challenge - University of Aberdeen Entry
Bouayad-Agha et al. (2012) issued a content determination challenge in which researchers were asked to create systems that can automatically select content suitable for a first paragraph in a Wikipedia article from an RDF knowledge base of information about people. This article is a description of the system built at the University of Aberdeen. Our working assumption is that the target text sho...
متن کاملThe suitability of common metrics for assessing parotid and larynx autosegmentation accuracy
Contouring structures in the head and neck is time-consuming, and automatic seg-mentation is an important part of an adaptive radiotherapy workflow. Geometric accuracy of automatic segmentation algorithms has been widely reported, but there is no consensus as to which metrics provide clinically meaningful results. This study investigated whether geometric accuracy (as quantified by several comm...
متن کاملGround Truth, Reference Truth & “Omniscient Truth” -- Parallel Phrases in Parallel Texts for MT Evaluation
Recently introduced automated methods of evaluating machine translation (MT) systems require the construction of parallel corpora of source language (SL) texts with human reference translations in the target language (TL). We present a novel method of exploiting and augmenting these resources for task-based MT evaluation, assessing how accurately people can extract Who, When, and Where elements...
متن کاملCombined Mapping of Multiple clUsteriNg ALgorithms (COMMUNAL): A Robust Method for Selection of Cluster Number, K
In order to discover new subsets (clusters) of a data set, researchers often use algorithms that perform unsupervised clustering, namely, the algorithmic separation of a dataset into some number of distinct clusters. Deciding whether a particular separation (or number of clusters, K) is correct is a sort of 'dark art', with multiple techniques available for assessing the validity of unsupervise...
متن کامل3d Landscape Metrics to Modelling Forest Structure and Diversity Based on Laser Scanning Data
This paper investigates the potential of laser scanning data to model forest 3d structure and its spatial pattern. Most of the existing methods for assessing structures in mountain forests are either inventory methods, which cannot be used for spatial assessments over large areas, or methods aimed only at assessing actual wood production. Several new landscape metrics are developed and applied ...
متن کامل